HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism
نویسندگان
چکیده
Recently, some challenging tasks in multi-agent systems have been solved by hierarchical reinforcement learning methods. Inspired the intra-level and inter-level coordination human nervous system, we propose a novel value decomposition framework HAVEN based on for fully cooperative problems. To address instability arising from concurrent optimization of policies between various levels agents, introduce dual mechanism inter-agent strategies designing reward functions two-level hierarchy. does not require domain knowledge pre-training, can be applied to any variant. Our method achieves desirable results different decentralized partially observable Markov decision process domains outperforms other popular algorithms.
منابع مشابه
Reinforcement Learning in Cooperative Multi–Agent Systems
Reinforcement Learning is used in cooperative multi–agent systems differently for various problems. We provide a review on learning algorithms used for repeated common–payoff games, and stochastic general– sum games. Then these learning algorithms is compared with another algorithm for the credit assignment problem that attempts to correctly assign agents the awards that they deserve.
متن کاملFederated Control with Hierarchical Multi-Agent Deep Reinforcement Learning
We present a framework combining hierarchical and multi-agent deep reinforcement learning approaches to solve coordination problems among a multitude of agents using a semi-decentralized model. The framework extends the multi-agent learning setup by introducing a meta-controller that guides the communication between agent pairs, enabling agents to focus on communicating with only one other agen...
متن کاملBaselines for Joint-Action Reinforcement Learning of Coordination in Cooperative Multi-agent Systems
We report on an investigation of reinforcement learning techniques for the learning of coordination in cooperative multiagent systems. Specifically, we focus on a novel action selection strategy for Q-learning (Watkins 1989). The new technique is applicable to scenarios where mutual observation of actions is not possible. To date, reinforcement learning approaches for such independent agents di...
متن کاملImproving on the reinforcement learning of coordination in cooperative multi-agent systems
We report on an investigation of reinforcement learning techniques for the learning of coordination in cooperative multiagent systems. These techniques are variants of Q-learning (Watkins, 1989) that are applicable to scenarios where mutual observation of actions is not possible. To date, reinforcement learning approaches for such independent agents did not guarantee convergence to the optimal ...
متن کاملCooperative Multi-agent Control Using Deep Reinforcement Learning
This work considers the problem of learning cooperative policies in complex, partially observable domains without explicit communication. We extend three classes of single-agent deep reinforcement learning algorithms based on policy gradient, temporal-difference error, and actor-critic methods to cooperative multi-agent systems. We introduce a set of cooperative control tasks that includes task...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2023
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v37i10.26386